Systemic delay propagation in the US airport network 



Pablo Fleurquin 

Instituto de Fisica Interdisciplinar y Sistemas Complejos IFISC (CSIC-UIB), 
07122 Palma de Mallorca, Spain, 
Innaxis Foundation & Research Institute, 
Jose Ortega y Gasset 20, 28006 Madrid, Spain 

Jose J. RamasccQ and Victor M. Eguiluz 
Instituto de Fisica Interdisciplinar y Sistemas Complejos IFISC (CSIC-UIB), 
07122 Palma de Mallorca, Spain 

Technologically driven transport systems are characterized by a networked structure connecting 
operation centers and by a dynamics ruled by pre-established schedules. Schedules impose serious 
constraints on the timing of the operations, condition the allocation of resources and define a baseline 
to assess system performance. Here we study the performance of an air transportation system in 
terms of delays. Technical, operational or meteorological issues affecting some flights give rise to 
primary delays. When operations continue, such delays can propagate, magnify and eventually 
involve a significant part of the network. We define metrics able to quantify the level of network 
congestion and introduce a model that reproduces the delay propagation patterns observed in the 
U.S. performance data. Our results indicate that there is a non-negligible risk of systemic instability 
even under normal operating conditions. We also identify passenger and crew connectivity as the 
most relevant internal factor contributing to delay spreading. 



I. INTRODUCTION 

Air transportation systems have been traditionally de- 
scribed as graphs with vertices representing airports and 
edges direct flights during a fixed time period [1, 2 . 
These graphs are called airport networks and have been 
studied at different geographical resolution scales, re- 
stricted, for instance, to a single country (usually the U.S. 
(US AN) [H] but also China [7] or Europe [8 ), or for the 
whole world (WAN) [TJ[2]. These networks show high het- 
erogeneity in the distribution of connections per airport 
and in the traffic sustained by each connection. A non 
linear relation between the number of connections of the 
airports (topology) and the number of passengers (traf- 
fic) has been observed in Ref. pQ and used later for mod- 
elling [9 . Furthermore, airport networks are structured 
in clusters of highly interconnected airports that reflect 
the geographical areas in which the traffic is naturally 
divided [10]. The dynamics of the connections and the 
traffic levels have been also analyzed for the US AN [4]. 
All of these are aspects of the graphs that influence their 
capability to transport persons, goods and even other 
less desirable passengers. For example the propagation 
of infectious diseases at a global scale that occurs when 
infected persons travel across the network [TTHT5] . The 
modelling and forecasting of disease spreading patterns 
using air traffic data is a story of a notable success [T3l - 
[T5] . One can, thus, wonder if this success can be extended 
to the propagation of other phenomena. In particular, we 
are interested in considering here flight delays and the 
way in which congestion can become a systemic risk. 



According to the 2008 Report of the Congress Joint 
Economic Committee, flight delays have an economic im- 
pact in the U.S. equivalent to 40.7 billions of dollars 
per year [15] , while a similar cost is expected in Eu- 
rope p~7j [18]. The situation can turn even grimmer in 
the next decade since the air traffic is envisaged to in- 
crease [T5HT9] . Delays damage companies' balances due 
to enhanced operation costs contributing to deteriorate 
their image with costumers [20]. Passengers suffer a loss 
of time, even more acute in case of missing connections, 
that translates into decreased productivity, missed busi- 
ness opportunities or leisure activities. Additionally, at- 
tempts to recover delays lead to excess fuel consump- 
tion and larger CO2 emissions. As a consequence of this 
challenging situation, a considerable effort has been in- 
vested in the area of Air Traffic Management to charac- 
terize the sources of initial (primary) delays [2TJ [22] and 
the way in which they may be transferred and ampli- 
fied by consequent operations, the so-called reactionary 
delays [19, 23-27 . The concept of delay itself implies 
a time difference with respect to the baseline provided 
by a predefined schedule [2TJ [24]. The propagation of 
delays thus corresponds to the spreading of a malfunc- 
tion across the system. The mechanisms responsible for 
it reflect the complexity of air traffic operations. Apart 
from the airport networks structure and dynamics, other 
factors contributing to the delay propagation are airport 
congestion [25] , plane rotation or crew and passenger con- 
nection disruptions [23j [25J [27] . Airline schedules typi- 
cally include a buffer time to deal with all these issues. 
However, when this time is not enough, the departure of 
the next flight gets delayed and can affect further opera- 
tions in a cascade-like effect [23] . There has been several 
attempts to model delay spreading [28-33J. These stud- 
ies differ in the level of detail included but in general 
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FIG. 1. Characterization of flight delays in the U.S. dur- 
ing 2010. A) Distribution of the delay per flight for arrivals 
and departures. B) Distribution of departure delays sepa- 
rating the flights according to the season: Summer and win- 
ter. C) Delay distribution for flights departing from Atlanta 
Hartsfield- Jackson (ATL), New York John F. Kennedy (JFK) 
and Honolulu (HNL) airports. 
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FIG. 2. In A), difference between the scheduled and real 
Turn Around Time (AT AT) for operations in Atlanta airport, 
ATL, on March 12. In B), distribution of the AT AT per 
flight, separating positive and negative contributions. 



they consider the effects of delays or disruptions in the 
operations of a few major airports (hubs). In this work, 
we take instead a network-wide perspective to analyze 
the performance of a transportation system. We define 
metrics able to quantify the level of spread of the de- 
lays in the network. We then apply these metrics to a 
database with information on the operations in the U.S 
during 2010, and introduce a model that reproduces the 
delay propagation patterns observed in the data. The 
model shows also a notable capacity to evaluate the risk 
of development of system-wide congestion and to assess 
the resilience of daily schedules to service disruptions. 



the scheduled flights for every day of 2010. A detailed 
description can be found in Appendix A. It is important 
to note that this schedule is based on real events, which 
in some occasions may differ from the original planned 
schedule of the companies. If a flight gets canceled, 
diverted or even rescheduled the airline may introduce 
changes in the original schedule that are not possible to 
trace back. However, given that these flights represent, 
respectively, the 0.20% and 1.75% of all flights in the 
database, one can expect these changes not to be of large 
magnitude. 



II. DATABASE 



III. MODEL 



The data was downloaded from the web page of the 
Bureau of Transport Statistics (BTS) [35]. In partic- 
ular, we used the Airline On-Time Performance Data, 
which is built with flight statistics provided by air carri- 
ers that exceed one percent of the annual national rev- 
enue for domestic regular service. The database compre- 
hends 6, 450, 129 scheduled flights operated by 18 carriers 
connecting 305 different commercial airports. The total 
flights operated in the US in 2010, not only those that 
report on-time performance data, sum up 8, 687, 800 [36 . 
Therefore, the database comprises information account- 
ing for 74% of them. The information per flight includes 
real and scheduled departure (arrival) times, origin and 
destination airport, an identification code (tail number) 
for each aircraft, airline, etc. This data enables us to rep- 
resent the US airport network and furthermore replicate 



The modeling approach followed is agent-based at the 
level of aircrafts and is data-driven in the sense that the 
daily schedules and the primary delays are obtained di- 
rectly from real records in the database. This level of 
realism is necessary to confront the model predictions 
with the real unfolding of the delay events during each 
day. Concretely, the model dynamics simulates three 
main subprocesses: aircraft rotation, flight connectivity 
and airport congestion. The latter two are independent 
from each other, and can be turned on/off to explore the 
relevance of each subprocess in leading to network-wide 
congestion. Aircraft rotation, on the other hand, is in- 
trinsic to the schedule and cannot be suppressed. 

The basic time unit of the simulations is one minute, 
every aircraft state is tracked at this temporal resolution. 
We assume that the flights are not able to recover delays 
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FIG. 3. Clusters of congested airports. Maps of the congested airports showing also connections between them for days with: 
A) low, B) intermediate, and C) high level of congestion. The airport color codes are: red, congested airport belonging to the 
largest cluster; orange, congested airport not belonging to the largest cluster; green, airport not congested. Links connecting 
airports in the largest cluster are in red. In D), daily size of the largest cluster as a function of time. In E), complementary 
cumulative distribution of the size of the largest cluster (log-normal scale). And in F), Jaccard index comparing airports 
belonging to the largest clusters in consecutive days or consecutive ranking positions according to the top 20 days with largest 
or lowest average delay. 



on air, and so the departure delays are equal to those 
at arrival to destination. Throughout a day, each air- 
craft follows the connections given in the schedule, the 
so-called plane rotations. The airports are supposed to 
have a capacity per hour proportional to the scheduled 
airport arrival rate with a proportionality factor j3. Fur- 
ther arrivals produce delays. Passengers (crew) of incom- 
ing flights have a certain probability of connecting with 
other flights within a time window of 3 hours from the 
scheduled arrival. The probability of connection is pro- 
portional, with a factor a, to flight connectivity levels 
provided by the BTS for each U.S. airport. A more pre- 
cise description of the model is included in Appendix B. 
This model has, thus, two free parameters: a, controlling 
passenger connectivity, and /?, accounting for airport ca- 
pacity. In the following section, we will examine the effect 
of these parameters on the systemic spread of delays. 



IV. DATA ANALYSIS AND COMPARISON 
WITH MODEL PREDICTIONS 

Flight delays are denned as the difference between the 
scheduled and real departure (arrival) times [21, 24J. Ac- 
tually most of the flights operated in 2010 were on time, 
even some before schedule, but 37.5% of those report- 
ing performance arrived or departed late. Their delays 
do not show a characteristic value: the delay distribu- 



tion displays a broad tail as can be seen in Figure 1A. 
This implies that most flights arrived late by just a few 
minutes, while others were hours behind schedule. The 
shape of the distributions is similar regardless of the ar- 
rival or departure nature of the operations. The planned 
buffer time on ground for each aircraft should help ab- 
sorb part of the delays, specially those mildest as will be 
discussed next, thus altering the shape of the departure 
delay distribution. However, this factor is not able to 
substantially modify the characteristics of the distribu- 
tions. Interestingly, the shape of the delay distribution 
does not change either when the season of the year is con- 
sidered. Summer concentrates the major part of the year 
traffic, so the total delay is higher but when the distribu- 
tion of delay per flight is taken into account both summer 
and winter behave similarly (see Figure IB). The over- 
all distributions of delays are thus quite robust. Some 
small differences can be only observed when focusing on 
particular airports. In Figure 1C, the departure delay 
distribution is plotted for Atlanta, JFK New York and 
Honolulu airports. While the distributions in Atlanta 
and New York are similar, the Honolulu airport shows 
a bias toward larger delays due to its isolation from the 
continent. 

The effect of the buffer time in the airports for absorb- 
ing delays can be measured using the Turn Around Time 
(TAT). The TAT stands for the time spent by an aircraft 
on ground from arrival to departure from the gate. This 
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FIG. 4. Comparison model-reality. Evolution of the largest cluster per hour: A) for the full model, B) the model only with plane 
rotations, C) only with plane rotations and passenger connections and D) only with plane rotations and airport congestion. 
The selected days are the ones with the lowest delay (April 19) and the second day with the largest delay (March 12). 



measure is associated with airport operational efficiency 
and is used to improve the planning of flight connectivity 
and aircraft rotational sequence stability [34]. We refer 
as AT AT to the difference between scheduled and real 
times at the gate. On the one hand, a negative value of 
AT AT means that an aircraft stayed at the gate longer 
than expected and so fresh delay was introduced. On the 
other hand, a positive AT AT shows that the operation 
was quicker than scheduled and that part of the delay was 
recovered. In Figure 2 A, we depict AT AT for each flight 
along a day in the most trafficked airport of the network: 
Hartsfleld- Jackson in Atlata (ATL). That day, March 12, 
happened to be one of the worst in the database in terms 
of average flight delay. The abundance of positive values 
of AT AT is a prove in favor of the capacity of the airport 
to recover delays. The distributions of AT AT for all the 
operations in 2010 separated in positive an negative val- 
ues are displayed in Figure 2B. These distributions, as 
those for the delays, show long tails, which is a marker 
of the complex nature of delay spreading mechanisms. 

The focus so far has been on individual flight delays. 
We define now a metric of congestion for the full network. 
To do so, the average delay of all delayed flights during 
the year is taken as baseline and amounts to 29 minutes. 
An airport is considered as congested whenever the aver- 
age delay of all its departing flights over a certain period 
of time exceeds 29 minutes. Additionally, a daily air- 
port network is built using the flights of the day to assess 



whether congested airports are organized in connected 
clusters or not. Note that being in the same cluster is a 
measure of spatio-temporal correlation of congestion but 
not necessarily a sign of a cause-effect relation. We apply 
the same metric in the simulations in order to compare 
empirical and model results. Maps with the congested 
airports and the connections between them are shown 
for different days of the database in Figures 3A-3C. As 
can be seen, the scenario dramatically changes from day 
to day: in some days a large cluster surges covering 1/3 
of all airports, while in others only one or two airports 
cluster together. This is confirmed when the size of the 
largest connected cluster is depicted as a function of the 
day in Figure 3D. A strong variability is thus the main 
characteristic of the dynamics of the size of the largest 
congested cluster. The cumulative distribution of the 
cluster size is displayed in Figure 3E and it seems com- 
patible with an exponential decay. Even if the fluctua- 
tions are large, there exists a well defined characteristic 
cluster size. Given the cluster variability, an important 
question to answer is whether the congested airports are 
recurrent. In panel 3F, we calculate the Jaccard index 
to compare the sets of airports in the largest cluster in 
consecutive days or for the top 20 worst and best days. 
This index is 1 if the clusters are equal and if they are 
strictly different. Interestingly, the index is relatively low 
for days with large clusters, which implies that the same 
airports are not consistently part of the cluster. 
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FIG. 5. Assessment of the schedule resilience to develop large clusters. In the plots, the size of the largest congested cluster is 
displayed as a function of the fraction of initial delayed flights and of the intensity of the initial delays for a congested March 
12 in B) and D), and for an uncongested day on April 19, A) and C), for two values of the flight connectivity factor a. An 
initial fixed delay is assigned to randomly chosen flights. 



In order to compare empirical results and model pre- 
dictions regarding the evolution of the cluster of con- 
gested airports, we run the model fixing the airport ca- 
pacity parameter /3 = 1 and fitting the flight connectivity 
factor a to obtain a maximum cluster size similar to the 
one observed as the data. By fixing f3 to 1, we are as- 
suming the same airport capacity as originally scheduled. 
The results for the temporal evolution of the congested 
cluster size hour by hour can be seen in Figure 4 for 
March 12 and April 19. Similar plots for other days of the 
year are included in Appendix C. Note that the fit of a is 
essential to get the maximum of these curves, however all 
the cluster size evolution predicted by the model follows 
strikingly well that of the real data. Actually, almost 60% 
of the airports in the real cluster are correctly identified 
by the model since they are top ranking when airports 
are ordered by probability of congestion. Furthermore, 
by fixing a, without any fitting, the model can predict 
with 66% accuracy if a day will develop or not a large con- 
gested cluster (see Appendix C for further details). The 
model allows us also to explore which are the contribu- 
tions of the main three ingredients (plane rotation, flight 
connectivity or airport congestion) to propagate delays. 
From Figures 4B-C, we can conclude that flight connec- 
tivity is the most important factor. One may still wonder 
if the picture changes when the capacity of the airports 
is modified. Actually, the model exhibits weak sensitiv- 
ity to variations on the (3 coefficient as shown in Figure 
19. Slightly increasing the airport capacity will not ease 
off the propagation of delays since the main cause of the 
spreading, flight connections, is independent of it. Con- 



versely, a very strong decrease on the airports' capacity, 
around 50%, is needed to trigger new primary delays that 
later on will spread in a cascading effect. This might be 
the case when generalized severe weather conditions or 
labor conflicts occur. 

The initial delays affect the outcome of the model. In 
the results of Figure 4, we take the primary delays for 
each aircraft from the data as initial conditions for the 
model. Introducing different initial conditions, we can 
assess the resilience of a day schedule to an increase of 
unexpected incidences. This question is explored in Fig- 
ure 5 where a fraction of randomly selected flights are 
delayed. The size of the largest cluster is estimated as 
a function of the fraction of delayed flights and of the 
intensity of the initial delays. For the sake of simplicity, 
we set all the initial delays in the simulation equal to 
a fixed value (delay intensity in Figure 5). The results 
are displayed for the schedules of two days: April 19 and 
March 12, which respectively show a very small and very 
large cluster in the real data. In particular, the average 
flight delay on March 12 was the second largest in 2010. 
The congestion on the worst day of the year, October 
27, can be explained due to extreme meteorological con- 
ditions [37], while on March 12 no major external event 
was reported. Therefore, the network-wide propagation 
of delays in that day was likely caused and driven by in- 
ternal mechanisms of the system. Comparing in Figure 5 
the curves for March 12 and April 9, one notices that the 
surface representing the largest cluster size for March 12 
are displaced toward smaller values of the initial delay 
intensity or fraction of flights with primary delay. This 
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signed as found in data or to randomly selected flights but 
keeping the same values as in the data. 



shows a higher susceptibility of the schedule of this day 
to disruptive perturbations. Another interesting feature 
of the curves of Figure 5 is that, given enough primary 
delays, they show a non-negligible risk of systemic fail- 
ure regardless of the schedule. The curves in Figure 5 for 
different values of a also confirm the relevance of connec- 
tions and crew rotations for the spreading of delays. 



The primary flight delays in a day of real operations do 
not necessarily localize randomly in the network. If the 
causes are bad weather, technical or labor issues are more 
prone to concentrate in a few airports. In Figure 6, this 
issue is explored by comparing the intra-day evolution of 
the cumulative size of the largest congested cluster when 
the initial delays are introduced in the model in two dif- 
ferent ways. The first one is by using the primary delays 
given in the database. The second procedure is by ran- 
domly shuffling the flights affected by the primary delays. 
The values of the real delays in the database are main- 
tained but they are assigned to flights selected at random. 
The comparison of the curves for the two cases with the 
real data shows that random perturbations are way more 
efficient to collapse the system. While airports in general 
have some capacity to recover delays, the random selec- 
tion of delayed flights affect a larger number of them and 
besides concentrate a heavier burden on smaller airports 
which have less capacity to react. This result evinces that 
the method followed for schedule evaluation in Figure 5 
is conservative in the sense that it considers the sched- 
ule under a non favorable scenario for the distribution of 
primary delays. 



In summary, we analyze the spreading of delays in an 
air traffic network. In particular, our results focus on 
the US airport network in 2010 but the concepts and 
techniques employed can be easily extrapolated to the 
analysis of the performance of a generic transport sys- 
tem. We introduce a measure for the level of network- 
wide extension of the delays by defining when an airport 
is considered as congested and studying how congested 
airports form connected clusters in the network. The 
size of the largest congested cluster displays in the data 
a high variability from one day to the next. This fea- 
ture is due to the re-start that the system suffers at the 
end of each day and points toward the relevance of the 
daily schedule to define the delay propagation patterns. 
In addition we introduce a data-driven model able to re- 
produce the delay evolution observed in the data. The 
model includes three main mechanisms to spread delays: 
Plane rotation, flight connections of either passenger or 
crews and airport congestion. The last two processes can 
be modulated at will to understand the role that each 
one of them plays in delay propagation. Our simulations 
evidence that passenger and crew connections is the most 
effective single mechanism to induce network congestion. 
We show how the model can be used to assess the daily 
schedule ability to deal with an increase in the number 
of disruptive events and also study the relevance of pri- 
mary delay localization for the evolution of congestion in 
the network. Furthermore the model offers the possibil- 
ity of evaluating the effects of interventions in the system 
before their real implementation. 

Flight delays represent failures to meet constraints im- 
posed by a daily schedule. Its propagation in the network 
is a paradigmatic example of the way in which a dis- 
tributed transport system moves toward collapse. The 
framework develop in this work is thus of easy exten- 
sion to system with dynamics regulated by predefined 
schedules. Its translation to other airport networks is, of 
course, straightforward, and even though the modeling 
of other transportation systems may require some par- 
ticular details, the applicability of the metrics defined to 
measure network-wide congestion based on clustering is 
universal. 
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FIG. 7. Cumulative distribution of the number of flights and 
number of different connections (degree) for the airports in 
2010. 



Appendix A: Database 
1. Description 

The database was obtained using the information avail- 
able at the Bureau of Transport Statistics [35]. In par- 
ticular, we used the Airline On-Time Performance Data, 
which is built with flight data provided by air carriers 
that exceed one percent of the annual national revenue 
for domestic scheduled service. The database compre- 
hends 6, 450, 129 scheduled domestic flights operated by 
18 carriers connecting 305 different commercial airports. 
Considering all flights in 2010, not only those that re- 
port On-Time Performance Data, the number of sched- 
uled domestic flights totalizes 8, 687, 800 [36] and so our 
dataset includes information for 74% of the total. It is 
worth noting that this schedule is based on real events, 
which is not necessarily the original schedule in hands of 
the companies at the beginning of the day. If a flight 
gets canceled, diverted or even in milder situations the 
managers of an airline can introduce changes in the orig- 
inal schedule that we cannot trace back. However, given 
that these flights represent, respectively, the 0.20% and 
1.75% of all flights in the database, one can expect these 
changes not to be of large magnitude. 

Among the available data fields, we consider the fol- 
lowing as the most relevant for our work: Tail number, 
airline ID, airports of origin and destination, date of the 
flight, scheduled departure and arrival times, real depar- 
ture and arrival times, and whether the flight was can- 
celed or diverted. The tail number is a code that identi- 
fies the aircraft and that allows us to follow it along the 
daily plane rotation. Arrival and departure times (real 
or scheduled) considered refer to when the flight actually 
reaches or departs from the gate, we are not taking tax- 
ing or take-off and landing times as departure (arrival) 
times. We will exclude of the coming analysis diverted 
and canceled flights since they difficult the characteriza- 
tion of the delay propagation and are a small fraction of 
the total operations. 

A network between airports can be built based on the 
data: Airports are the vertices and edges represent di- 
rect flights from one airport to another. Note that this 



is a directed graph and that depends on the time-scale 
of data aggregation. On an annual basis, the resulting 
US air-transportation network comprises 305 commercial 
airports and 2, 318 connections. Figure [7] depicts the 
complementary cumulative distribution of the number of 
flights and different connections for all the airports of the 
network in 2010. Both distributions confirm the presence 
of high heterogeneities in the airport network. The most 
active airports for 2010 are represented in Table [I] show- 
ing that the maximal degree corresponds to Atlanta In- 
ternational Airport (ATL) with 159 different connections. 
For the analysis of the clusters of congested airports, we 
will considered networks aggregated only during one day. 



Airport code 


# edges 


# flights 


ATL 


159 


809,869 


ORD 


147 


608,981 


DFW 


140 


524, 206 


DTW 


128 


314,369 


DEN 


125 


470, 592 


MSP 


116 


246, 245 


IAH 


107 


362,562 


SLC 


94 


246, 245 


MEM 


86 


152,730 


MCO 


83 


241,851 



TABLE I. Major airports according to their degree (number 
of different destinations). 

The average delay per delayed flights (those with pos- 
itive delays) for 2010 is 29 minutes. This same value is 
used to define when an airport is considered congested in 
the main text but it can also be used to define days of 
operational problems or not. Those are respectively days 
whose average delay per delayed flight is over or below 
29 minutes, respectively. Table [II] shows the ranking of 
the 20 best and worst days of the year according to their 
average delay for flights with positive delay. 

The United States spans through several time zones. 
In order to unify criteria and simplify the analysis, we 
transform all the local times to the East Coast local time. 
Olson or tz database [38] is used to ensure an accurately 
timezones conversion from the respective local times in 
the database to the East Coast local time (EST in win- 
ter and EDT in summer time). Plotting the departure 
probability as a function of the scheduled departure in 
Figure [8] we can distinguish a zone that goes from 00am 
to 4am with almost no operations. We set thus the start 
of a new day in our analysis and simulations at 4am East 
Coast local time, that is, at 3am Central, 2am Mountain 
and lam Pacific. In this way, the starting of the new day 
coincides with the low activity phase of air operations in 
most of the country. 

We also notice that for daily networks 98% of the edges 
are bidirectional on average, i.e., if there is a flight from 
A to B there is always a flight from B to A. Taking this 
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Problematic days 



Satisfactory days 



DATE 


Average delay (mins.) 


DATE 


Average delay (mins.) 


Oct, 27 


54.3 


Apr, 19 


16.9 


Mar, 12 


53.0 


Oct, 09 


17.2 


Dec, 12 


51.9 


Nov, 11 


17.3 


Jan, 24 


49.8 


Apr, 14 


17.6 


Feb, 24 


49.1 


Oct, 08 


18.0 


May, 31 


46.8 


Set, 11 


18.4 


May, 21 


45.5 


Apr, 15 


18.4 


May, 14 


44.6 


Oct, 13 


18.5 


Jun, 23 


44.6 


Apr, 17 


18.5 


Jul, 13 


44.3 


Nov, 10 


18.8 


Jun, 24 


42.8 


Nov, 09 


18.9 


Jul, 12 


42.7 


Mar, 06 


19.1 


Jan, 21 


41.5 


Oct, 12 


19.2 


Jul, 29 


41.4 


Mar, 17 


19.3 


Jun, 15 


41.2 


Feb, 28 


19.5 


Jun, 27 


40.5 


Oct, 16 


19.5 


Mar, 20 


40.5 


Apr, 13 


19.5 


Mar, 11 


39.9 


Nov, 26 


19.5 


Aug, 22 


39.7 


Set, 09 


19.6 


Jan, 25 


39.5 


Set, 20 


19.7 



Annual fraction of connecting passengers for 
each US commercial airport 



Another key input for modeling the delay propagation 
over the network is the connection between flights. The 
previous database has no information regarding flight 
connectivity, neither for the crews nor the passengers. 
In order to at least approximate the heterogeneity of the 
airports in this sense, we used the T100 Domestic Market 
(US carriers) and the DB1B Ticket information down- 
loaded from the BTS page [35]. These documents allow 
us to obtain an approximation of the annual fraction of 
connecting passengers for each airport. The information 
of T100 corresponds to the total number of passengers 
who have a flight departing from an airport regardless 
of their real point of origin (Passenger stioo) • On the 
other hand, the database DB1B contains a 10% sample 
of the number of passengers whose itinerary originated in 
each given airport (Passenger sd bib)- So for each air- 
port we can get an approximation of the annual fraction 
of connecting passengers as: 



Passenger stioo — lO.PassengersrjBiB 
Passenger stioo 



(Al) 



TABLE II. Ranking of the 20 worst/best days of the year 2010 
according to their daily average delay for flights with positive 
delay. 



Although our model is based on flight not passenger 
connectivity, we assume that these ratios are related, 
which is always better than assuming arbitrary values, 
with a controlling the intensity of such relation. 



0.08 r 



2 



5 

CO A 

Oh U 

Q 



06 - 
04 - 



0.02 - 



^^^^^^^ 
Local Time 



Airport code 


Fraction of connecting passengers 


ATL 


0.81 


ORD 


0.72 


DFW 


0.75 


DTW 


0.69 


DEN 


0.71 


MSP 


0.68 


IAH 


0.75 


SLC 


0.73 


MEM 


0.81 


MCO 


0.69 



TABLE III. Fraction of connecting passengers for the top ten 
airports in degree. 



FIG. 8. Probability of flight departure as a function of the 
scheduled departure hour. 



Appendix B: Model description 



To simulate the delay propagation, we developed an 
into account, we symmetrized the network to simplify the agent-based model that combines within the same frame- 
cluster analysis. work queuing and a schedule based approach dynamics. 
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1. Overview 

As stated in the main text, one of the purposes of this 
model is to understand how delays propagate and mag- 
nify considering internal operational factors and sched- 
ule. As it will be explained further below, "extrinsic" or 
primary delay is given at the initial steps of the simula- 
tion to the first flight of the day for some aircraft rota- 
tions, and then let this perturbation evolve multiplying or 
diminishing the delay according to the particular struc- 
ture of the system. Concretely, the model dynamics will 
be based on three subprocesses which are: (i) aircraft ro- 
tation, (ii) flight connectivity and (iii) airport congestion. 
The last two are independent from each other, and can be 
turned on/off to explore the relevance of each subprocess 
in the delay propagation dynamics. Aircraft rotation, on 
the other hand, is intrinsic to the schedule and so we do 
not switch it off. 

We use one-minute intervals as the basic time step unit 
in the model and proceed in each simulation until the 
schedule of a selected day is completed (all flights had 
completed their itinerary). In most cases, this means 
slightly more than 1,440 minutes. This time interval al- 
lows the simulation to execute actions at a realistic con- 
current time-scale and is the finest level available in the 
data. As mentioned in the previous section, 4am East 
Coast local time is set as the starting point for airport 
operations and to begin the aircraft rotational sequences. 
By this selection, we ensure that most aircraft rotational 
sequences are sorted correctly and it is the natural choice 
considering the daylight time flow in the United States. 
Also, as mentioned before, to arrange the schedule in a 
real sequential order we converted time data from Local 
operation time to Eastern local time. 



2. Hierarchy of Objects 

a. Aircraft (tail-number) 



airport origin gate to the airport destination gate. The 
latter is defined as the time the aircraft remains parked 
at the airport gate (Figure p)]). 



Wheels on 



Wheels off 



t t I ~ 

Scheduled Scheduled Block-to-block time 

arrival time departure time 



FIG. 9. Turn- around and block-to-block time/phase defini- 
tion. 



Flights are characterized by a tail- number, origin 
airport, destination airport, schedule departure time 
(Tsch.d) and schedule arrival time (T sc h. a )- Block-to- 
block time (Tj>) between two airports is calculated as: 



— Tsch.a 



1 sch.di 



(Bl) 



where j corresponds to destination airport and i to the 
origin one. Another issue worth noting in our model is 
that, in the block-to-block phase we do not allow for delay 
absorption or reduction. This could only be achieved in 
the turn-around phase by means of the difference between 
the actual arrival time of the previous flight leg and the 
scheduled departure time of the next flight leg. 



c. Air carrier (airline id) 

Air carriers are the second level unit in the model. 
Each aircraft has an airline associated via the airline code 
id. Only aircrafts having the same airline id are allowed 
to int eract during the process of flight connectivity (see 
B 3 b for further details) . 



The airplane is the primary fundamental agent of the 
simulation. The number of airplanes that participate in 
the simulation varies with the day considered, but it is 
around 4, 000. Each aircraft is unique and comes iden- 
tified by their tail number. This code allows us to re- 
construct the rotational sequence of the plane during the 
day. This sequence can be subdivided in individual flight 
legs or point-to-point flights. 



Point-to-point flight 



d. Airport 

The airport is an intermediate-level entity located in 
space coordinates, where interactions among aircrafts 
take place. This interaction occurs indirectly through 
the s chedule, flight connections or airport queues (see 
B3c for further details). Each airport is different from 
the others because of their planned capacity and the lo- 
cal aggregation of the schedule. Airports play the role of 
nodes in the transport network. 



This is the basic schedule unit. It is the minimum 
package of information used as an input to relocate an 
aircraft from an origin to a destination airport, meeting 
the planned schedule. During their itinerary an aircraft 
can be in one of two flight phases: block-to-block or turn- 
around phase. The former is the time elapsed from the 



e. Clusters of congested airports 

This is a high-level entity that represents interactions 
between airports. The clusters are formed by airports 
whose average (departure) delay per flight is higher or 



11 



equal to 29 m inutes and are linked by a direct connec- 
tion (see B6 for further details). In most cases, we are 
interested in the largest cluster of the full day (or by 
hour in some cases). The size of a cluster is measured 
according to the number of airports that belong to it. 



Figure 10 shows a representation of two clusters (Clus- 



ter A and B) constituted by airports whose average de- 
parture delay per flight in a certain time period is equal 
or larger than 29 minutes (red dots). Apart from this 
condition airports within these clusters are linked by a 
direct connection. In this case, cluster A correspond to 
the largest cluster in a certain time period according to 
the number of airports that form this cluster. 



Scheduled 
arrival time 



£1 



_ 



Scheduled 
departure time 



_ 



Actual arrival time 
Flight A 



Actual departure time 
Flight A 



Departure delay 



Scheduled turn around time 



FIG. 11. Aircraft rotation description. 



b. Flight connectivity 




FIG. 10. Red dots correspond to airports whose average de- 
parture delay per flight in a certain time period is > than 
29 minutes. Green dots correspond to airports whose average 
departure delay per flight in a certain time period is leq than 
29 minutes. 



3. Subprocesses 

a. Aircraft rotation 

During a day, each aircraft has an itinerary to accom- 
plish that in the vast majority of cases consists of two or 
more flight legs. Naturally, to complete a flight leg, the 
previous ones have to be fulfilled, e.g., it is not possible 
to depart from San Francisco to Honolulu if the airplane 
has not completed the previous leg from Atlanta to San 
Francisco. Besides this evident situation, if an aircraft 
arrives late (inbound delay) and the delay cannot be ab- 
sorbed by the turn-around time it will depart late in the 
next flight leg (Figure [IT]) . Usually, a buffer time is in- 
cluded in the turn-around phase to absorb this type of 
delay but this is already incorporated in the schedule ob- 
tained from the data. 

Another feature of this subprocess, is that in the turn- 
around phase each aircraft, when arrived, has to comply 
with a minimum service time T s , in the simulations set as 
30 minutes. This service time includes operations such as 
refueling, passenger unboarding/boarding, luggage han- 
dling, safety inspection, etc. 



In addition to rotational reactionary delay, the need 
to wait for load, connecting passengers and/or crew from 
another delayed airplane from the same fleet (airline id) 
may cause, as well, reactionary delay. 

Sch. arrival Sch. arr. time. Sch. arr. time. Sch. arr. time. Scheduled departure 



time. Flight B | Flight B | 
Airline X Airline X 



Flight C | 
Airline Y 



Flight D | 
Airline X 



4 4 4 



time 
Flight E | Airline X 



FIG. 12. Possible connections within flights of the same air- 
line. 



For each flight at a particular airport, connections from 
that airport are randomly chosen as follows. Firstly, we 
take a AT window prior to the scheduled departure time 
of the flight. Secondly, we distinguish possible connec- 
tions of the same airline from other flights, that have a 
scheduled arrival time within the AT window (Flights 
B and D in the example of Figure 



12). Finally, from 



these possible connections we select those with probabil- 
ity a * flight connectivity factor. The flight connectivity 
factor was defined in |A 2| and a is an effective parameter 
of control that allows to modify the strength of this ef- 
fect in the model. For instance, a = means that there 
is no connection between flights with different tail num- 
ber, while a = 1 makes the fraction of connecting flights 
of the same airline equal to the fraction of connecting 
passengers in the given airport. In the simulations, a 
is varied according to the case under study and AT is 
always taken to be 180 minutes (3 hours). 

Let us suppose that from the previous example Flight 
D was randomly selected. 

By this subprocess an airplane is able to fly if and if only 
their connections have already arrived to the airport, if 
not it has to wait until this condition is satisfied (Figure 
13). It is important to note that flight connectivity is the 
only source of stochasticity in the model due to a lack of 
knowledge about the real flight connections within the 
schedule. 
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Sch. arrival 
time. Flight D 
| Airline X 



£1 



Scheduled departure 

t j me Actual departure time. 

Flight E | Airline X Flight E | Airline X 
* waiting time 



Actual arrival time. 
Flight D | Airline X 



Departure delay 



FIG. 13. Flight connectivity description. 



c. Airport congestion 



March 12 




Time [EST] 



Because airports are entities with a finite capacity, the 
possibility of their congestion has to be introduced in 
the model. Interactions between aircrafts other than the 
ones defined by the schedule (flight connectivity and air- 
craft rotation) are in this way taken into account. This 
occurs indirectly through an airport's queue. That is to 
say that delays from airplanes of different airlines can 
delay others because they congest the airport. The de- 
lay spreading does not surge so easily as in the previous 
cases, it requires a cumulative effect of several delayed 
aircrafts to perturb the airport efficiency and once this 
condition is meet the delay spread to other aircrafts and 
affect other airlines. 

We assume a "First in-First Served" queuing protocol 
that is the most widely used queue operation and simple 
to introduce in the model. In the simulations each airport 
will have a capacity that varies throughout the day ac- 
cording to the Scheduled Airport Arrival Rate (SAAR). 
This means that for every airport we measure the sched- 
uled flights that arrive per hour and this is the nominal 
capacity for each hour of the day (Figure [l4|) . Due to 
reactionary delays aircrafts may not arrive as planned 
and the Real Airport Arrival Rate (RAAR) will vary. 
Whenever RAAR > SAAR, a queue begins to form with 
the arriving aircrafts. Naturally, airplanes that are not 
in q ueue a re being served and this service time lasts T s 
(see B3a for further details). It should be noticed that 



once an aircraft starts to be served this process cannot be 
interrupted no matter how SAAR varies. We define an- 
other effective control parameter (3 in order to modify the 
nominal capacity of the airports. This parameter multi- 
plies the SAAR and in the simulations presented here 
affects all the airports in the same way. For instance, if 
we want to introduce a buffer capacity of 20%, j3 is set 
to 1.2. 



4. Initial conditions 

Initial condition refers to the situation of the first flight 
of an aircraft sequence, meaning when, where and the 
departure delay of this flight. As mentioned in the main 



FIG. 14. Example of SAAR for three major airports: Atlanta 
International Airport (ATL), O'Hare International Airport 
(ORD) and Denver International Airport (DEN). 



text, variations on this situation can have a great impact 
on the delay propagation. In other words, the dynamics 
of delays over the network is highly sensitive to the initial 
conditions. 

We characterized initial conditions by the average delay 
per flight for the first flights of all the aircraft sequences 
and by the fraction of airplanes that their first flight was 
delayed. Comparing the ranking of the 20 worst and 
best days of 2010 (Figure 15) we can observe that it is 
most likely that if a day started with unfavorable initial 
conditions it will likely produce large congested clusters. 

The simulations can be initialized by two different ways 
depending on the case under study: from data or random 
initial conditions. 



a. From the data 

Initializing the model "from the data" means to repli- 
cate exactly the situation of the first flights of all the 
aircrafts sequences for a particular day. 



b. Random initial conditions 

When random initial conditions are set, initial delays 
are reshuffled among all possible aircrafts, so when and 
where may vary. Two inputs are needed: initial delay 
and fraction of flights initially delayed. For instance, 

Initial delay: 20 minutes 

Percentage of airplanes initially delayed: 10% 
Suppose that the number of aircrafts for one day sim- 
ulation is 4, 000. In this example, 400 aircrafts will have 
their first flight departing with an initial delay of 20 min- 
utes. 
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FIG. 15. Initial conditions of the 20 worst days (red) and of 
the 20 best days (green) of 2010. 



5. Decision Tree 

Model flowchart summary including all the subpro- 
cesses. Flowchart objects in green and red will be ex- 
plained separately. 

• Generate class objects: 
Once the data is loaded for a particular day into 
the data class object, the remaining class objects 
are created using this data structure. These objects 
are: 

— Airport list: Indexation of all the airports that 
operated that day. 

— SAAR matrix: Includes the hourly capacity 
(schedule airport arrival rate) for every airport 
in the list. 

— Airport Flight Connectivity Factor List. 

— Adjacency list: Contains the network struc- 
ture for that day. 

— Tail number: Indexation of all the air crafts 
that operated that day. 




Data class, |_\^ 
Airport list, Saar 

array, Flight 
Connectivity list, 
Adjacency list, Tail 
num list, Queue list, 
Flight situation list 



Load Data 


jfJatabase 






AND J 




r 1 



Flight idx ^ 
Aircraft idx 
Origin airport idx 
Dest airport idx 




Schedule 
index +1 



Delete flight from 
Schedule and 
aircraft from 
Destination Queue 



Aircraft 




departure 






departure delay +1 
min & inbound delay 
+1 min 



departure delay +1 
min & service time 
+1 min 



departure delay +1 
min & queue delay 
+1 min 



departure delay +1 
min & connection 
delay +1 min 



Schedule: For each flight the schedule ob- 
ject contains the information described in sec- 
tion B 2 b , initial delay (see B 4 ), flight index, 
flight status ( on land "L", flying "F" and in 
service or in queue "S"), inbound index (pre- 
vious flight leg index) and connections (see 
B 3 b ) . All flights are initialized with flight 
status "L". 

Tail number situation: For each aircraft con- 
tains the origin airport, the destination air- 
port, the scheduled and actual block-to-block 
time and the departure delay (initial, inbound, 
queue and due to connections). 
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— Airports tail number queue: For each airport 
contains the aircrafts ordered as First in - 
First served. 

— Airports flight queue: The same as the previ- 
ous one but indexed with flight number. 

• Update class objects & t -hi min: 

Objects as Schedule and Airport tail number and 
flight queues are synchronous updated for each time 
step. 

• Aircraft arrival: 

The flight status is changed from "F" to "S" and 
the airport's tail number and flight queues are up- 
dated. 

• Aircraft can depart?: 

The aircraft can depart if the service time (30 min- 
utes) is complete and the are no flight connections 
to wait for. Initial flight legs of an itinerary are 
considered as already served. 

• Aircraft departure: 

Tail number situation and origin airport queues are 
updated. The actual block-to-block time is reset. 
Flight status is changed from "L" to "F". 

• Previous flight legs are not completed? : 

Check if the inbound index is among the flight con- 
nections and the flight status is " L" . 

• Aircraft is in service?: 

Inspect if the flight status for the aircraft is "S" 
and the service time is different from zero or the 
aircraft position at the airport queue is less than 
airport capacity. 

• Aircraft is in airport's queue?: 

Check if the flight status is "S" and the service time 
is zero. 

• Connection/ s not landed?: 

Verifies if the number of connections in the schedule 
for the flight is zero and the flight status is "L". 



If it is so, label the airport with its index 
and insert the airport index in the active 
list. 

Else, label the airport as -2 (not delayed). 
While the active list continue to have air- 
ports to explore: 
o For each airport in the active list: 

* Explore its neighbors in the adja- 
cency list. 

* Check if they are labeled as unex- 
plored and their average delay per 
flight is greater than 29 minutes. 

* If it is so, label them with the same 
index as before and insert the air- 
port index in the active list. 

* Else, label the airport as "not de- 
layed" . 

o Remove from the active list the air- 
ports that their neighbors had been 
explored. 



7. Overview of the model parameters 



Parameter 


October 27 March 12 December 12 July 13 


T s [min] 


30 


AT [min] 


180 


a 


0.263 


0.190 


0.265 


0.075 


p 


1.0 


Initial Condition 


"From the data" 


Parameter 


October 9 


April 19 




T s [min] 


30 


AT [min] 


180 


a 


0.020 


0.020 


13 


1.0 


Initial Condition 


"From the data" 



6. Clustering 

1. Create a cluster list with all airports labeled as -1 
(unexplored). 

2. Create an empty list (active list) to include the air- 
ports to inspect while traversing the adjacency list 
(network). 

3. While unexplored airports continue to exist in the 
cluster list: 

• For each airport in the cluster list: 

— Check if the airport is unexplored and the 
average delay per flight for the airport is 
greater than 29 minutes. 



TABLE IV. Overview of default values of the model's param- 
eters. The values of a correspond to the best fit for the day. 



Appendix C: Model simulations 

1. Model validation and sensitivity to a 

Figure [16| displays results for other days different from 
the ones presented in the text. Results for December 



12 (Figure [16] A), July 13 (Figure [16] B) and October 9 
(Figure 16 C) confirm that the model is in good agree- 
ment with the data when a is fitted for each day. In 
the case of October 27 (Figure KL6] D), the size of the 
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FIG. 16. Evolution of the largest cluster size for A) December 
12 (a = 0.265), B) July 13 (a = 0.075), C) October 9 (a = 
0.002) and D) October 27 (a = 0.263) 
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FIG. 17. Frequency of the largest cluster size for all days of 
2010. 
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cluster evolved much faster than the model prediction, 
although the size could be predicted. Analyzing the pos- 
sible explanation to this difference, we found that severe 
weather conditions occurred that day across an impor- 
tant part of the country [37] affecting flights in airports 
such as Hartsfield- Jackson (Atlanta), John F. Kennedy 
(New York), La Guardia (New York), St. Paul (Min- 
neapolis), O'Hare (Chicago), Philadelphia and Newark. 
External perturbations were not explicitly introduced in 
the model so we cannot expect to be able to reproduce 
well these days delay dynamics. 

In the previous sections we have defined days/airports 
with problems as those whose average delay per delayed 
flight was over 29 minutes. Another way, related to the 
previous one, of classifying the days is by means of the 
largest cluster size of the day. To do so, we set a cluster 
size that corresponds to 15 airports so that if the largest 
cluster size in a day is higher than this threshold the day 
is labeled as problematic or unsatisfactory. On the other 
hand, if it is less than 15 airports the day is labeled as 
satisfactory. This threshold was selected because in the 
distribution of largest cluster size there exists a small 
depression at this value (Figure[l7|). This particular value 
for the threshold is arbitrary. Still, we have repeated the 
analysis with some other thresholds and checked that the 
main conclusions are maintained. 
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FIG. 18. Exploring the model forecast accuracy by varying 
the a parameter. All days of 2010 are taken into account. 



TABLE V. Ranking for the top 20 days by the average de- 
lay for flights with positive delay. Model accuracy according 
to the classification of each day in satisfactory or unsatisfac- 
tory. The model is able to predict unsatisfactory days with 
an accuracy of 70% and satisfactory ones with an 85%. 
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March 12 



Airport Code 


Precentage Realizations 


Accurate Prediction 


ATL 


100.0 


Yes 


CWA 


100.0 


Yes 


DFW 


100.0 


No 


DLH 


100.0 


Yes 


EAU 


100.0 


No 


EYW 


100.0 


Yes 


FLL 


100.0 


Yes 


GGG 


100.0 


No 


MGM 


100.0 


Yes 


MIA 


100.0 


Yes 


ORD 


100.0 


No 


SJT 


100.0 


No 


STT 


100.0 


Yes 


TOL 


100.0 


No 


BHM 


99.8 


Yes 


CAK 


99.5 


Yes 


CHA 


98.6 


Yes 


FAY 


98.4 


Yes 


MEM 


98.3 


Yes 


HSV 


97.9 


No 



different. Even so, this simple method provides us with 
a quantitative framework to validate the model and to 
assess the importance of including further mechanisms 
in the simulation. 

Another accuracy test was done to check if the model 
is able to predict not only the size but the airports that 
comprises the largest cluster of the day. We selected 
March 12 whose largest cluster is formed by 97 airports. 
The model is stochastic, so we run it for 1500 realizations. 
Comparing the data with the model results for the top 97 
airports most frequently appearing in the largest cluster, 
the model accurately identify 57.8% of them. Table VI 
displays the Top 20 airports which are more prevalent 
in the simulations showing if they appeared in the real 
data as part of the largest cluster for March 12. This is 
a first comparison, since the real cluster is coming from 
a single realization in a particular day it cannot be taken 
as a definitive validation of the model. However, an ac- 
curacy of 57.8% with such a simple framework is already 
encouraging. 



2. Analysis of the model sensitivity to changes in f3 



TABLE VI. Top 20 ranking of airports that appear more fre- 
quently in the largest cluster for the model results compared 
to what actually occured on March 12. 

The introduction of the threshold allow us to define 
a binary variable associated to the performance of the 
network each day. Since the model requires a fit in a to 
reproduce the precise dynamics of the congested clusters, 
the aim of this exercise is to set a generic value of a and 
study how many of the satisfactory /unsatisfactory days 
are actually predicted. According to our definition, dur- 
ing 2010, 75% of the days get a satisfactory performance. 
In order to assess the model correspondence with reality, 
we have to take into account that satisfactory days out- 
weigh unsatisfactory ones. Naturally, with a high a the 
model simulations predict unsatisfactory days with high 
accuracy but provide many false positives for satisfac- 
tory days. On the other hand, with a low a, most of 
days with small clusters are successfully predicted but 
not those with large congested clusters. Bearing this in 
mind, we defined the percentage of accuracy as a tradeoff 
between the percentage of accuracy for satisfactory and 
unsatisfactory days. Figure [18] show the fraction of cor- 
rect predictions both for satisfactory and unsatisfactory 
days. Both curves cross at a value of a = 0.087 and at 
an accuracy rate of 65.9%. Obviously, this is a simplis- 
tic technique to measure performance. A more elaborate 
technique should include appropriate economic consider- 
ations to take into account that the cost related to false 
positives, claiming that a day is going to have a large 
congested cluster without actually occurring, and false 
negatives, not being able to predict a major collapse, are 





April, 19 




March, 12 


p 


Average largest 


P 


Average largest 




cluster size [airports] 




cluster size [airports] 


1.0 


1.0 


1.0 


92.0 


0.9 


1.0 


1.1 


92.3 


0.8 


1.0 


1.2 


89.9 


0.7 


1.0 


1.3 


88.7 


0.6 


1.3 


1.4 


86.4 


0.5 


16.8 


1.5 


86.4 



TABLE VII. P variation for April 19 with an a fixed at 0.02: 
airports should work at half the scheduled capacity to trans- 
form this day into an unsatisfactory one. /3 variation for 
March 12 with an a fixed at 0.19: an increase of 50% in 
P decreases the size of the cluster by only a 7%. 

As stated in the main text, the model is able to repro- 
duce the clusters of congested airports by fitting a while 
fixing p to 1. In fact, as shown in Table [VTl| the model 
has a low sensitivity to a variation in the (3 coefficient. 



For April 19 Table \VU\ shows that only by cutting the 
scheduled capacity by half, the day will start to have 
systemic problems according to the size of the largest 
cluster. In the case of March 12 the scheduled airport 
capacity is increased by 50% and the results indicates 
that this increment does not change the overall picture. 



Furthermore, Figure 19 B shows that increasing airports' 
capacity will not ease off the propagation of delays. The 
reason for this is that the main cause of delay spreading, 
flight connections within the schedule, is independent of 
the airport capacity. Conversely, by reducing P by at 
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least 50% can worsen the situation (Figure [l9| A) . Such 
decrease on the airports' capacity can act as a trigger to 
new primary delays (different from the initial ones) that 
later on will spread in a cascading effect due to the flight 
connectivity. Although, a decrease on the scheduled ca- 
pacity of 50% for every airport in the network is not likely 
to occur in practice, a much realistic situation could be 
an airport or group of airports operating undercapacity 
when severe weather conditions are met. 



A) April 19 




Time [EST] 



B) March 12 




Time [EST] 



of flight connections are randomly selected, March 12 will 
continue to display a large cluster. 

March 12 




Time [EST] 

FIG. 20. Exploring the variability of the model results. 

In Figure [20] we can differentiate a growing phase that 
goes from 4am to 5pm and a declining phase from 5pm 
onwards. As already said, merging is critical for the 
size evolution of the clusters. Because in the first hours 
of an unsatisfactory day there are several clusters, thus 
more possible combinations of merging events, the grow- 
ing phase is characterized by a stronger variability than 
the declining phase. The latter, depicts a low variability 
and as Figure [21] A shows the number of clusters do not 
increase during this phase. All in all, this indicates that 
no atomization into smaller clusters is produce when the 
size diminishes. The cluster size dissolves continuously. 



4. Further results on cluster and individual airport 
dynamics 



FIG. 19. Dependence of the hourly largest cluster with aS 
variation in ft. A) April 19 and B) March 12. 



In any case, airport congestion could be the source for 
primary delays but it does not seem to be an important 
force behind their network-wide propagation. 



3. Stochastic variability of the results 

Because of the stochasticity included in the model each 
realization has a slightly different outcome. Figure [20] 
displays the variability between model realizations of the 
results for March 12 considering a confidence interval of 
95%. Simulations in this case were done using initial con- 
ditions "from the data" ; this means that the stochasticity 
is caused only by flight connectivity. No matter which set 



Besides the evolution of the size of the largest cluster 
per hour, dynamics can be characterized by the evolution 
of the number of clusters during the day (see Figure 21). 
While in a satisfactory day (Figure |2T 
ber of clusters varies in each hour wit 



C and D) the num- 
]hout a recognizable 
pattern, in an unsatisfactory day (Figure |2l] A and B) 
the number of clusters increase in the first hours of the 
morning and from then on decay merging into fewer clus- 
ters, in most cases, in the afternoon (eastern time). This 
high-level interaction dynamics between clusters appears 
to be crucial in the evolution of an unsatisfactory day, 
where high-degree nodes play an important role to make 
this merging event come about. 

However events involving individual nodes occur and 
varies dramatically from time to time. Figure [22] displays 
how nodes that belong to the largest cluster of the day 
vary their condition rapidly. One hour they are above 
the 29 minutes threshold and next they recover and vice 
versa. Most nodes switch from one state to the other very 
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FIG. 21. Evolution of the number of clusters. Comparison be- 
tween data and model results for: A) March 12, B) December 
12, C) April 19 and D) October 9. 



A) March 12 

60 



B) December 12 




FIG. 22. Results obtained from the data. Number of airports 
that belongs to the largest cluster of the day for A) March 
12 and B) December 12. Red color indicates the number of 
"old" airports (that their average departure delay per flight 
has been > 29 minutes at least in the previous hour as well), 
while new airports that match this condition are shown in 
orange. The nodes which their average departure delay per 
flight will drop below 29 minutes in the next hour are shown 
in green (if they have been in problem for only one hour) and 
blue (if they have been in problem at least for two hours). 



quickly, although some few nodes repeat their condition 
at least two time steps (red series). 

In order to study the temporal persistence of air- 
ports in the largest congested cluster across the whole 



database, we display in Table [VIII| the list of the top 10 
airports in days in the largest congested cluster and in 
days with problems. Although some airports appear in 
both lists, the order changes and both sets are not ex- 
actly equal. In these lists, there is a strong component of 
airports located in the West Coast. We think that this is 
due to the time difference between East and West Coasts. 
Flight operations initiate before in the East Coast and so 
the delays can propagate Westwards toward the end of 
the day. In the results in the main paper, we show that 
the largest congested clusters are not persistent, at least 
not in more than ~ 50% of the airports between different 



days. The airports in Table VIII are those most persis- 



Airport code 


days in 


Airport code 


days with 




largest cluster 




problems 


ACV 


100 


OTH 


167 


CEC 


80 


CEC 


138 


SFO 


54 


ACV 


136 


OTH 


52 


LMT 


111 


MOD 


49 


MOD 


90 


EWR 


45 


CIC 


86 


CIC 


45 


MFR 


70 


LMT 


44 


BRW 


62 


MFR 


43 


CRW 


60 


CRW 


41 


MLB 


60 



TABLE VIII. Top 10 raking of airports in number of days 
belonging to the largest congested cluster or in number of 
days with problems. 



tent in the largest congested cluster. It is interesting to 
notice that only two major hubs, Newark (EWR) and 
San Francisco (SFO), are present in the top 10 list. 



